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^ ■ Survey: Information flow on trees 

, ^ \ Elchanan Mossel 

m 

^\] ■ Abstract. Consider a tree network T, where each edge acts as an independent 

copy of a given channel M, and information is propagated from the root. For 
which T and Af does the configuration obtained at level n of T typically contain 

□^ , significant information on the root variable? 

^^ ■ This model appeared independently in biology, information theory, and 

statistical physics. Its analysis uses techniques from the theory of finite markov 
chains, statistics, statistical physics, information theory, cryptography and 
jrt ■ noisy computation. In this paper, we survey developments and challenges 

related to this problem. 



> 

■^ , 1. Introduction 

vQ ■ Consider a process on a tree in which information is transmitted from the root 

f^ I of the tree to all the nodes of the tree. Each node inherits information from its 

^^ ■ parent with some probability of error. The transmission process is assumed to have 

^P I identical distribution on all the edges, and different edges of the tree are assumed 

r^ ■ to act independently. 

As this process represents propagation of a genetic property from ancestor to its 
descendants, it was studied in genetics, see e.g. [71 I39j . In communication theory, 
this process represents a communication network on the tree where information is 
transmitted from the root of the tree. Earlier, the process was studied in statistical 
physics, see e.g. |41lll7l l5]. 

The basic question we address in this survey is: Does the configuration obtained 
C^ ' at level n of T typically contain significant information on the root variable? The 

theory of finite markov chains implies that if the underlying markov chain is ergodic 
(i.e. irreducible and aperiodic), then the variable at a single node at level n and the 
variable at the root are asymptotically independent as n — *■ oo. However, for the 
tree process, information is duplicated, so it is conceivable that the configuration 
at level n contains a significant amount of information on the root variable. 

In Section 121 a precise formulation of the problem is given. In Section |31 we dis- 
cuss the problem for symmetric binary channels (which correspond to Ising models 
with no external field) . This is the family of channels for which the most is known. 
In particular, is subsection 13.11 we compare various reconstruction algorithms for 
symmetric binary channels. 
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2 ELCHANAN MOSSEL 

Suppose that instead of the configuration at level n, we are given the census 
of the configuration at level n. In section 0] we see how the spectral properties of 
M determine if the census is asymptotically independent of the root. In Section [S] 
we present bounds for the problem for Potts models, while in Section we present 
some examples of channels which are related to secret sharing and demonstrate 
that level n may contain significant information on the root even if the census of 
the level contains no such information. 

In Section [7| we discuss related problems and terminology. Many unsolved 
problems are presented throughout the paper. 

Disclaimer: This paper is a survey: most of the results are presented without a 
proof; for others only a sketch is given. 



2. The reconstruction problem 

Definition of the process. Denote the underlying tree by T = {V,E). The 
information flow on each edge is given by a channel on a finite alphabet A — 
{1, . . . ,k}. Let Mi^j be the transition probability from i to j; M be the random 
function (or channel) which satisfies for all i and j that P[M{i) = j] = M^j-, and 
A2(M) be the eigenvalue of M which has the second largest absolute value (A2(M) is 
in general a complex number). We assume throughout the paper that M defines an 
ergodic markov chain (irreducible and aperiodic). At the root p one of the symbols 
of A is chosen according to some initial distribution. We denote this (random) 
symbol by a p. This symbol is then propagated in the tree in the following way. 
Given that the parent of v, denoted v' , has value ayi, the probability that a^ is j 
is given by M^ , j. More formally, for each vertex v having as a parent v', we let 
(jy = M„/_„((7„'), where the {M„/_„} are independent copies of M. Equivalently, for 
a vertex v, let v' be the parent of v, and let r{v) be the set of all vertices which are 
connected to p through paths which do not contain v. Then the process satisfies: 

PK == j\{<^w)wer{v)] = PK = j\<^v'] = M^^, J. 

Let d(, ) denote the graph-metric distance on T, and L„ ^ {v Cz V : d{p,v) — n} 
be the n'th level of the tree. For v G V and e = {v, w) E E we denote \v\ = d{p, v) 
and |e| = max{|t;|, |w|}. We denote by cr„ = {a{v))y^L„ the symbols at the n'th 
level of the tree. We let c„ — (c„(l), . . . , c„(fc)) where 

Cnii) = #{w e Ln ■■ a-{v) == i}. 

In other words, c„ is the census of the n'th level. Note that both {(Jn)^^i and 
(cn)j5°=i are markov chains. 

Reconstruction solvabiUty. For distributions P and Q on the same space, the 
total variation distance between P and Q is 

(1) i^y(P,Q) = i^|P(a)-Q(a)|. 

(7 

Definition 2.1. The reconstruction problem for T and M is solvable if there 
exist i,j G A for which 

(2) Mm DviPl,,Pi) > 0, 
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where P^ denotes the conditional distribution of an given that ap ^ £. We define 
census solvability similarly, where the measures P^ are replaced by measures P^, 
which are conditional distributions of Cn given that Op = L 

A stronger definition than Definition 12. II is obtained by replacing "there exists 
j, j" by "for all i, j" . We choose the weaker definition as we are interested to know if 
some information is propagated from the root to the boundary, see also Proposition 
O below. 

Equivalent definitions. If the reconstruction problem is solvable, then ct„ con- 
tains significant information on the root variable. This may be expressed in several 
equivalent ways. Assume that the variable at the root, ap, is chosen according to 
some initial distribution (7ri)ig^, and let P'^ denote the corresponding probability 
measure. The maximum-likelihood algorithm, which is the optimal reconstruction 
algorithm of ap given cr„, is successful with probability 

^"(tt) = V" P^'ifTn = cr] maxP''[crp == i\an = a] 

a 

> max^ P^[(T„ = (t]P'^[o'p = i|cr„ = cr] = maxTT,;. 

a 

Note that it is possible to reconstruct Op with probability max^ tt^ even when ct„ 
is unknown (using the algorithm which always reconstructs the i which maximizes 
TTi). It is therefore natural to consider A„(7r) — maxiTTi as a measurement for the 
dependency between (T„ and a p. 

Let E be the entropy function, and let /(X, Y) = H{X) + H{Y) - H{X, Y) be 
the mutual- information operator (see e.g. |8| for definitions and basic properties). 

For a sequence of random variables X„ defined on the same probability space, 
let Fn be the a-algebra defined by {Xm,)m>n, i-e., F„ is the minimal cr-algebra such 
that all the variables {Xm)m>n are measurable with respect to Fn. Let Foo — 
n^i-Fn- We say that the sequence Xn has a trivial tail, if all the measurable 
sets with respect to F^c have probability either or 1. Otherwise, we say the the 
sequence has a non-trivial tail. 

In the theory of markov random fields the notion of tail triviality is closely 
related to the extremality of the measure, see e.g. |14| . 

The following equivalence follows from the fact that a„ is a markov chain (see 

e.g. my 



Proposition 2.1. Let T be an infinite tree and M a channel. Then the fol- 
lowing conditions are equivalent (where n denotes initial distribution for ap): 

(1) The reconstruction problem is solvable 

(2) There exists a n for which lim„^tx) -f (cp, an) > 0. 

(3) If IT is the uniform distribution on A, then lim„^oo I{ap,an) > 0. 

(4) For any distribution tt with mini tTj > 0, it holds that lim„^oo -^("'pi o'n) > 
0. 

(5) There exists a tt for which liminf„^oo A„(7r) > max^ tt^. 

(6) If n is the uniform distribution on A, then lim inf „_^oo A„ (tt) > 1/|^|. 

(7) For all tt with miuiTTi > 0, {an)^=i has a non-trivial tail. 

(8) There exists a it with min, tt^ > such that (<t„)^]^ has a non-trivial tail. 

The analogous 8 conditions are equivalent for c„ . 
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3. The Ising model 

The only family of channels for which the problem is well understood is the 
family of symmetric binary channels 

,3) ^-{l-' \-. 

where \2{M), the second largest (in absolute value) eigen value of M, satisfies 
A2(M) — 1 — 2e. Channel Q corresponds to the Ising model on the tree. The 
free measure for the Ising model on a finite tree is the probability measure on 
configurations a of ±1, given by 

(4) P[cr] = - exp(^ cr„cr„), 



where Z is a normalizing constant. The correspondence between (O and Q) is 

cxp(-/3) 
oxp(~/3)+oxp(/3)' 



given by e = cxaf'^m+fiDffl^ ' °^ equivalently, A2(M) = tanh/? 



Theorem 3.1. The reconstruction problem is solvable for the binary symmetric 
channel with error probability e 0), and the b-ary treeTh, if and only ifb\\{M) > 1. 
If b\\{Ad) > 1, then the reconstruction problem is also census solvable. 

Reconstruction (and census) solvability when bX^iM) > 1 was first proved in 
|17j ( |23| is earlier and does much more, but is formulated in the language of 
multi-type branching processes). 

Proof. We think of the spin values as ±1. Write A = 1 — 2e, and let 5„ be 
the sum of the ± variables at level n of the tree. Given that the spin at the root is 
+, the expected value of S'„ satisfies 



(5) E+[5„]- J2 E+[<T.]=6"A 



n \ n 

111 [Ov\ — u 

veL„ 



Similarly, E [Sn] = —6" A". The second moment of S'„ satisfies 
(6) 

E+[Sl] = E-[Sl] = Y. EK^^] - &" ( 1 + J2^V - b>-^)\^^ I = 9(62" A^"), 

where the last equality follows from the fact that bX^ > 1. By Cauchy-Schwartz, 

E+[5„]-E-[5„] - ;^(P+[r7]-P-M)5„(a) 



It now follows by Q and © that when b\^ > 1, 

^ p+M + p-H ^ > 

which implies that I?y(P+, P") = 9(1). D 

The reconstruction solvability result when b\X2{M)\'^ > 1 is extended to general 

trees |12j and general channels |23LI30| . where b is replaced by the branching number 
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of the tree and A2 is the second largest eigenvalue of the matrix M (in absolute 
value). 

The proofs of the non-reconstruction result when bX^iM) < 1 are harder, and 
do not generalize to other channels. We know of 4 different proofs for this result 

• The first proof |6j , is based on recursive analysis of the Gibbs measure. 

• A proof of non-reconstruction which is based on information inequalities 
is given in 1125 where it is shown that the mutual information between the 
variable at the root of the tree and the level n variables satisfy /(cp, cr„) < 
X^uGL ^{^pi'^v)i ^s in the case of conditionally independent variables. 
This proof extends to general trees when br(r)A2(Af) < 1, where br(r) 
is the branching number of the tree. 

• In |18j recursive analysis is used in order to show E[E^[(Tp|iT„]] tends to 
zero as n ^ 00 for the n-level tree, when b\\{M) < 1. The proof |19) 
applies also to general trees when br(T)A2(M) < 1. 

• Glauber dynamics is the following reversible Monte-Carlo method for sam- 
pling configurations a according to the distribution JSJ or Q. Given the 
current configuration a, a vertex v is picked uniformly at random at rate 
1, in which case the variable a^ is replaced by a random variable a'^ chosen 
according to the conditional distribution on the rest of the configuration, 
io'w)w^v In Q] it is shown that Glauber dynamics have spectral gap 
which is bounded away from zero when bX^iM) < 1. Then using a gen- 
eral principle (which is proven in a much more general context) we obtain 
that the reconstruction problem is unsolvable when 6A|(Af) < 1. 

In j37| the critical case for general trees, br(r)A2(Af) = 1, is analyzed in detail. 



3.1. Reconstruction algorithms. Theorem lH.ll reveals a surprising phenom- 
enon: reconstruction by global majority vote has the same threshold for success as 
maximum likelihood reconstruction (which is the optimal reconstruction strategy) . 

The parsimony method is popular in biology. Given a bicoloring of the bound- 
ary of a tree T, a parsimonious coloring of the internal nodes is any assignment of 
the two colors to these nodes that minimizes the total number of bicolored edges. A 
way of finding a parsimonious coloring is the following: Starting from the parents 
of the boundary nodes, assign recursively to each internal node the color of the 
majority of its ±l-colored children. In case of a tie, assign the non-color "?" . Then 
scan the tree from the root downwards and assign all vertices labeled by ? the same 
label as their parent. 

On a fixed finite tree, when e > is small, the maximum likelihood algorithm 
will reconstruct the same root value as one of the parsimonious colorings given the 
boundary. 

However, this is not the case when e is larger. For the binary tree, it is shown in 
|42| that the parsimony reconstruction algorithm has success probability bounded 
away from 1/2 as n ^ 00 if and only if e > 1/8. Thus when A2(M) = 1 — 2e £ 
(2~^/^, 3/4] on the binary tree, majority (and maximum likelihood) will have success 
probability bounded away from 1/2, while the parsimony success probability tends 
to 1/2. 

On a tree where each vertex has k children with k odd, the above algorithm for 
finding a parsimonious coloring reduces to recursive majority; In |31| it is shown 
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that reconstruction via this method succeeds asymptotically if and only if 

1 2^ fk- 1^ "^ 



(it is interesting to note that the proof is based on exactly the same recursion which 
is analyzed in the context of noisy computation in |16l llUj l. 

In j31| more general reconstruction algorithms on regular trees (and more gen- 
erally, on f-periodic trees) are analyzed. Suppose that in order to determine the 
color assigned to a node w, the algorithm is allowed to examine the colors of its 
descendants i generations down. (However, only a single bit can be stored at each 
node). Then it is shown in |31| that recursively applying majority vote of the de- 
scendants H. generations down is optimal, yet it succeeds asymptotically only for flip 
probabilities e below a threshold which is strictly lower than the critical value for 
reconstruction. 

Yuval Peres (private communication) conjectured that 

Conjecture 1. Consider the Ising model on the regular tree Tb and recon- 
struction algorithms which are local: algorithms that for each vertex are allowed to 
scan the information stored at its descendants £ generations down, and at most r 
bits of information are allowed to be stored at each node. Then for all £ and r the 
threshold for reconstruction for such algorithms is strictly below the threshold for 
reconstruction. 

We remark that it is important to require that the algorithm examines only 
vertices below the vertex which is being updated, as Glauber dynamics are local 
and are successful in reconstruction of the root whenever bX^iM) > 1. A step 
of Glauber dynamics is performed as follows. Given the current configuration a, 
an internal vertex v is picked uniformly at random at rate 1, in which case the 
variable a^ is replaced by a random variable cr(, chosen according to the conditional 
distribution on the rest of the configuration, {crw)w=^v 

We emphasize that the fact that recursive algorithms are asymptotically inferior 
to global majority does not hold for other Potts models or Ising models with external 
fields (122] I see also Section 0). 

4. Census solvability 

The threshold bX^iM) = 1 which appeared as the threshold both for recon- 
struction solvability and for census solvability for the Ising model, turns out to be 
in general the threshold for census solvability. 

Theorem 4.1. Let M be a channel corresponding to an ergodic markov chain. 
Let Til be the b-ary tree. The reconstruction problem is census-solvable ifb\X2{M)\'^ > 
1, and is not census solvable if b\X2{M)\^ < 1. For general trees, the reconstruction 
problem is solvable when hr{T)\X2{M)\'^ > 1, where br(T) is the branching number 
of the tree. 

Conjecture 2. The reconstruction problem is not census solvable when 

b\X2iM)f = 1. 

6|A2(M)p > 1 implies census solvability. |23| proves a limit theorem for the 
variables c„. In particular it is shown, that if b\X2{M)\^ > 1 then the distribution 
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of the limiting variable depends on the initial variable at the root. This implies 
that the problem is census solvable. 

A more elementary proof is given in i3Ui. The proof follows the lines of the 
proof for the Ising model fTheorem l3.1|l . where 5„ is replaced by the scalar product 
of c„ with any vector nonzero v satisfying Mu — X2{M)v. Note that this proof 

generalizes the proof for the Ising model, since for the Ising model, v = [ 

This proof also generalizes to general trees, proving that if br(r) | A2 (M) p > 1, then 
it is possible to reconstruct the root using the scalar product of v with a weighted 
census c„(i) = {J2x ^{^) • ^ G -^ni cr^ = i} for some weights {lu{x)}x^t- 

6|A2(M)p < 1 implies no census solvability. The CLT in |23j implies 
that if b\X2{M)\'^ < 1 then the normalized value of c„ (c„/6"/2 if b\X2{M)\'^ < 1) 
converges to a nonzero random variable which is independent of the variable of 
the root. However, this result on docs not imply that the reconstruction problem 
is not census solvable. Presumably, it may the case that the first coordinate of 
Cn is more likely to be even for some value of the root variable than for others. 
This dependency between the root variable and c„ would not manifest itself in the 
limiting normalized variables. 

In .30j we combine the results of |23l with the local central limit theorem to 
demonstrate that this could not happen. The idea of the proof is to use |23) in 
order to couple c^, the value of c„ given that the root variable is i, and c:^, the value 
of c„ give that the root variable is j in such a way that the variables are close (i.e., 
cjj — c^loo < e6"^^). Then use the local central limit theorem in order to achieve a 
coupling of c^_|_f and c^^^ with high probability. In j30| we also verify Conjecture 
13 for Potts models and asymmetric Ising models. 



5. Potts models 

Two of the natural generalizations of binary symmetric channels are asymmetric 
binary channels (which correspond to Ising models with external field), and q-sny 
symmetric channels (which correspond to Potts models with no external field) : 

Asymmetric binary channels have the state space {0, 1} and the matrices: 



• 



(7) M 



l-5i Si 

1-52 S2 



with \2{M) =62-61. 

Symmetric channels on q symbols have the state space {1, . . . , 9} and the 

matrices: 



(8) M = 



/ l-{q-l)S S ... S \ 

6 l-{q-l)6 S ... 



\6 ... S l-{q-l)S J 

with A2(Af) = l-q6. 

Depending on the sign of A2 (M) we distinguish between ferromagnetic Potts models 
where A2(M) > 0, and anti-ferromagnetic models where A2(M) < 0. When 1 — 



(q-l)-l {q-iy 
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{q — 1)6 — 0, we obtain the model of proper colorings of the tree: 

f (g-l)-i (g-l)-i ... (g-l)-i \ 

(9) M = 

V (9-1)-' •■• (9-1)-' / 

Problem 1. For the 3 symbols Potts model ^ find the values for which the 
reconstruction problem is solvable on the b-ary tree. 

It may be easier to solve the analogous problem for the Ising model with exter- 
nal field. The analogous problem for colorings was stated in [3]- Applying standard 
coupon-collector estimates recursively, it is easy to see that if 6 > (1 + 5)q\ogq and 
q is large, then the reconstruction problem is solvable for the coloring model. 

Problem 2. For colorings, for which b and q is the reconstruction problem 
solvable on the b-ary tree? 

Below we discuss several bounds for the reconstruction problem for Potts mod- 
els. 

• If b\2{M) > 1 then the reconstruction problem is solvable. This 
follows from Theorem l4.1l and from the fact that census solvability implies 
solvability. 

• If 6|A2(Af) < 1, then the reconstruction problem is unsolvable. 

Proof. Assume first that M is a ferromagnetic Potts model, i.e. 
X2{M) > 0. Consider two measures on the tree, one with i as the root 
variable and one with j as the root variable. We couple these measures in 
the following way: starting at the root if the two measures agree on the 
variable at v, then we couple in such a way that the measures also agree for 
all the children of v. If they do not agree at v, then for each of the children 
of V, use the optimal coupling in order to couple the measures. For each 
of the children, this has success probability qS. Thus the non-coupled 
vertices are a branching process with parameter 1 ~ qS — A2(M). When 
bX2{M) < 1 this process will eventually die; this means that for large n 
all the vertices at level n will have the same variables with probability 
going to 1 as n — > cxd, as needed. When M is anti-ferromagnetic, the 
coupling probability is (g — 2)5 + 2(1 — {q — 1)S) = 2 — q5, therefore the 
branching process parameter is l — {2 — q6) = — A2(M) = |A2(M)|. Similar 
arguments apply for Ising models with external fields. D 

If 6A2(M) > 1 and q is sufficiently large, then the reconstruc- 
tion problem is solvable. This is the main result of 32 . It implies in 
particular that 6|A2(M)p = 1 is not the threshold for the reconstruction 
problem for Potts models, as it sometimes possible to reconstruct even 
when &|A2(M)|^ < 1. An analogous result is proven for the asymmetric 
binary channel. The idea behind the proof is the following. Channel © 
may be thought of in the following way: at each step the output is identi- 
cal to the input with probability A2(M), otherwise, the output is chosen 
uniformly among the q symbols. In particular if X2{M) > is fixed and 
q is very large, then if two of the children of a vertex in the &-ary tree 
Tf, have the same label, then with overwhelming probability, this is also 
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the label of their parent. Now suppose that q is large and there exists a 
copy of T2 C Tb such that all the vertices of T2 are labeled by i. Using 
a recursive argument we see that given this event, with large probability, 
the variable at the root is i. Moreover we show that when h\2{M) > C for 
some constant C > 1, such a tree exists with positive probability. There- 
fore, it is possible to reconstruct the root variable based on the existence 
of such a unicolored tree. In order to obtain the result for C = 1, we 
replace the unicolored T2 by a diluted unicolored T2. 

lih -^y'^L^ — ^ then the reconstruction problem is unsolvable. 
This and the analogous result for asymmetric binary channels are proven 
in |30j . we sketch the main idea of the proof below. 

Proof. In order to show that the reconstruction problem is unsolv- 
able, it suffices to show that given that the root value is or 1 with 
probability 1/2 each, it is asymptotically impossible to conclude from the 
variables at level n, if the root is or 1 . Suppose that in addition to the 
variables at level n, we are also given all the variables at all levels of the 
tree having variable j with j ^ {0, 1}. Since we are given more informa- 
tion, it is easier to reconstruct. The model where we are given this extra 
information is nothing but the symmetric binary channel with matrix 
/ i-(g-i)^ 8 \ 

\ l-(q-2)S l-^(q-2)S ) 

on the random tree which is obtained from the original tree by inde- 
pendently deleting an edge with probability (q — 2)(5 and retaining it with 
probability 1 — (g — 2)(5. The results of jl2| imply that the binary symmet- 
ric channel on a general tree T , the reconstruction problem is unsolvable 
if br(T)A2(Af ) < 1. For the branching process on the regular tree Tf,, the 
branching number is "typically" 6 (1 — (g — 2)^). The non-reconstruction 

criterion br(T)A2(A/) < 1, now translates to 6 -^_7'^^w- < 1. D 

In Figure 1 we draw several of the bounds, where 6 is a function of A = A2(Af). 
The area above the top curve, h}? = 1, is the area where (census) reconstruction 
is successful for all channels. The top curve is the critical curve for the symmetric 
binary channel: above it reconstruction is successful and below it, it fails. Below 

the second curve, A = ^ ~_^ ' , reconstruction fails for the g = 3 Potts model. 
Below the bottom curve 6A = 1, reconstruction fails for all Potts models. This 
curve is also the asymptotic critical curve as g — > 00. 




0.2 0.4 0.6 0. 



Figure 1. bounds for 6 as a function of A2(Af) 



10 ELCHANAN MOSSEL 

5.1. Algorithms. We want to point out that the proofs in j32| imply that 
for Potts models when q is large, reconstruction using recursive schemes has better 
threshold than any algorithm which uses only the census. We conjecture that the 
phenomenon that global majority achieves the same threshold for reconstruction as 
maximum likelihood occurs only for symmetric binary channels. 

Conjecture 3. Consider Potts models 0) for q > 3. Then there exist b and 5 
such that the reconstruction problem is solvable for the b-ary tree, yet bX^iM) < 1. 

Note that this conjecture implies in particular, that any algorithm which uses 
the census only is inferior to the optimal algorithm for Potts models. 

5.2. Monotonicity. For Potts models ((SJl, it is easy to see that if the recon- 
struction problem is solvable for q and e and q' < q, then the reconstruction problem 
is also solvable for q' and e. We expect that for fixed A = X2{M), reconstruction is 
easier when q is larger. 

Conjecture 4. Consider two symmetric channels Mi and M2 (as in ^) on 
qi and (72 symbols respectively, where qi < q2- If < A2(A/i) = A2(Af2) and the 
reconstruction problem is solvable for Mi, then it is also solvable for M2. 

This is obvious when 52 is a multiple of qi . Using the reconstruction criterion 
for the binary symmetric channel on 2 symbols, it is easy to prove the conjecture 
when qi = 2. 

6. General channels 

In this section we discuss techniques which apply to the reconstruction problem 
for general channels. 

6.1. Proving solvability. Spectral methods. Theorem 14.11 implies that 
when &A2(M) > 1 the reconstruction problem is census-solvable (and therefore 
solvable) for the 6-ary tree and the channel M. 

Recursive methods. Starting at the boundary of the tree, we may try to evaluate 
recursively the variable at each vertex of the tree. Assuming that the probability 
of reconstructing the value of a variable at distance n from the boundary is pn, we 
obtain recursive bounds onp„+i. Bounding these recursions, we may prove that the 
reconstruction problem is solvable. Although for symmetric binary channels, these 
methods always achieve worse thresholds than the spectral method |31j . in many 
other cases the recursive methods are superior. The first example which was already 
discussed in SectionElis that of the Potts model on q symbols where q is large. It is 
proven in |32) that if 6A2(Af) > 1 and q is sufficiently large then the reconstruction 
problem is solvable (the spectral method only applies when bX^iM) > 1). In |30) 
we note that a similar argument proves that for the coloring problem for large q, 
if 6 > (1 + e)q\ogq, then the reconstruction problem is solvable. In general this 
method combined with standard large deviation estimates (see e.g. |2]) imphes: 

Theorem 6.1. Let M be a channel, such that for all i,j £ A, there exists 
an £ such that M-i^i =/= ^j,e- Then when b is sufficiently large, the reconstruction 
problem is solvable for the tree T^ and M . 

This is proven in |30| where a general criterion is given to decide: does there 
exist for the channel M a number b such that the reconstruction problem is solvable 
for Tb and M. 
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Example 6.2. Let {Zi}°^i be an i.i.d. sequence of variables such that P[Zi = 
0] = I — P[Zi = 1] = p where < p < 1. Let h > I and consider the channel M 
defined by the niarkov chain Yi — (Zi, . . . , Zi^h)- Thus M has state space {0, 1}''+^ 
with the product {p, I — p) probability measure. It is easy to see that for the tree 
process, the variable (CTt,)|t,|<„ are independent of {<yv)\v\>n+hj <^^d therefore the 
reconstruction problem is not solvable for M . On the other hand, letting 

Yi = max {Z, = ■ • • = Z^+j = 1}. 

0<j<h 

It is easily seen that Yi defines a channel M on the space {0,...,/i}. Moreover, 
it is clear that for all £ the variables {2^i}i>^+/i+i and {Yi}i<ce are independent. 
Therefore {Yi}i>l-^-h,+l cind {Yi}i<ce are independent. It follows that the variables 
M'^^^{j) have the same distribution for all j . Thus Ta.nk(M.^''^^) = 1, and X2{M) = 
0. Writing M: 

(p p{l^p) p{l-pf ... {l^pf 
1 ... ... 



M = 



\ 







1 







V ... p \-p j 

if follows from Theorem W. 1\ that the reconstruction problem is solvable for M and T^ 
provided that b is sufficiently large. This is a generalization of a channel appearing 
in [•il\ : see also |26| . 

The example above demonstrates that it may be the case that for the markov 
chain corresponding to the channel AI, the states at times t and t + h are indepen- 
dent, yet, for the tree process, the reconstruction problem is solvable. In fact, a 
much stronger phenomenon occurs 

Theorem 6.3. Let b > 1 be an integer and T be the 2-level b-ary tree. There 
exists a channel M such that for any initial distribution, Op and uq are independent 
(where Up is the root label, and ag is the configuration at the leaves of the 2-level b- 
ary tree), yet when B is sufficiently large, the reconstruction problem for the channel 
M and the infinite B-ary tree Tb is solvable. 



The construction in 30 is motivated by work on secret-sharing protocols |4£ 
and applies Theorem l6.il We define the channel below. For the proof we refer the 
reader to 2S|- 

construction. Let ^ be a finite field with q > b-\-2 elements. Let xi, . . . , xt+i be 
a fixed set of non-zero elements of J-. We define a channel on the state space 

J^''[x]^{fix):f{x)eT[x],degf<b}. 

Given /, take / to be a uniform variable in the set {1, . . . , 6 + 1}, then take M{f) 
to be 5 e J-''[x\ chosen uniformly among the g's satisfying g[{)) = f{xi). Given the 
value of / at 6 of the points xi, . . . , Xb+l^ and for all a & J-, there exists a unique 
polynomial satisfying /(O) = a. This implies that b independent copies of the chain 
at / give no information on /. In j30j it is shown that when the number of copies 
is sufficiently large, information is retained so that the reconstruction problem is 
solvable. 
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6.2. Proving non-solvability. We have a few techniques for proving non- 
solvability. 

Spectral gap of Glauber dynamics. Let A(n) be the spectral gap of Glauber 
dynamics for the n-level &-ary tree. In J2] we prove a result for general graphs which 
implies for Tb the following: 

Theorem 6.4. Suppose that M is a channel sueh that Glauber dynamics satisfy 
inf„ A(n) > 0, then the mutual information between Op the root variable ofTb, and 
an, the variables at level n decays exponentially fast: I{ap,an) = 0(exp(— ri(n))). 
In particular, the reconstruction problem for the tree T^ is unsolvable. 

Problem 3. Does it hold for reversible AI and the b-ary tree Tb that I{cTp, cr„) = 
0(exp(— f7(n))) if and only if vainK{n) > 0? 

For the Ising model on trees, the answer to the problem is positive, see [2]. 

Recursive analysis of niELximuni likelihood. A direct approach to the recon- 
struction problem is to analyze the distribution of the (log) likelihood of the root 
variable given the boundary variable. This leads to an iteration of random vari- 
ables. The only case in which this iteration was analyzed is the symmetric binary 
channel ([SY]) where this approach yields an exact criterion for reconstruction for 
general trees. It is an interesting challenge to extended this technique to other 
channels. 

It may be easier to analyze these recursions for " robust" phase transitions which 
first appeared in |38| . Consider the usual reconstruction problem, but suppose that 
the data at the boundary is given with some additional noise. The proofs that if 
b\\{M) > 1 the reconstruction problem is (census) solvable are immune to this 
noise. However, this may not be the case for the reconstruction problem. Indeed, we 
suspect that adding this additional noise (assuming it is fixed but sufficiently strong) 
will shift the phase transition to the point 6A|(Af) = 1. A similar phenomenon was 
proven in 38 for the phase transition of uniqueness. For n and m, we denote by 
o'n,m the configuration which is obtained from (t„ by applying the random function 
M™ independently on each of the symbols in (t„ . We denote by P^ „ the conditional 
distribution of <Tn^m given that Up — £. We then 

Conjecture 5. For all M and b, such that b\2{M) < 1, there exists m such 
that for the b-ary tree 

sup lim i:iy(P;,„,P^„) = 0. 

7. Terminology and related problems 

7.1. Related problems. In this subsection several variants of the reconstruc- 
tion problem are discussed. Throughout the section we will assume that the variable 
at the root is chosen uniformly. By Proposition 12. II reconstruction solvability is 
equivalent to the fact that there exists S > such that for all n with probability at 
least S, the conditional distribution of (jp given an has total variation distance at 
least S from the uniform distribution. 

We may consider the following variants of the problem: 

• non-uniqueness of the Gibbs measure. The condition here is that 
for all n there exists an such that the distribution of ap given an has total 
variation distance at least 6 > from uniform. This is a weaker condition 
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than reconstruction solvability and it was studied in statistical physics 
for Ising and Potts models. In particular, the phase transition for these 
models is known, see 15'. 

• dismantlable graphs. Suppose that we require that for all i e A and 
for all n there exists a„ such that P[(Jp = i|crn] = 1- This requirement 
clearly fails for all trees if the matrix M satisfies M^j > for all i and 
J. Moreover, this property depends only on which of the entries of M 
are non-zero. Define a directed graph G on ^ such that {i,j) is an edge 
of G iff 'M.ij > 0. We claim that there exists for sufficiently large b a 
b'&ry tree T for which the requirement holds iff for all i ^ j the sets 
N{i) = {i : (ij) £ G} and N{j) ^ {£ : (jj) e G} satisfy 

(10) N{i) <t N{j). 

proof. If N{i) C N{j) then for all n > 1 there exists no (T„ such that 
given (T„ the value of <Tp is i with probability 1. On the other hand, if for 
all i 7^ j, it holds that N{i) (/_ N{j), then given i, consider the labeling 
(T„ of Tb for b > \A\, which is obtained in the following way: The root 
satisfies Up — i. Given ct^,, label the children of w, denoted wi, . . . ,Wb, in 
such a way that {cr^,. : 1 < i < 6} = {^, (cr„,^) G G}. For all n it now 
holds that given an the value of the root ap must be «. D 

If (ll()|l holds for all i ^ j, then for large b for 5 > Q fraction of the am 
it holds that P[crp = j|cr„] = 1 for some i which depends on ct„. This may 
be proved using a recursive argument similar to |3UI Theorem 2.1]. 

We may replace the above requirement by the requirement that there 
exists i £ A such that for all n there exists cr„ such that given (t„ the 
variable Cp satisfies ap ^ i (with probability 1). Defining G as in the 
previous case, the property holds for T^ for sufficiently large & iff G is 
dismantlable^ see U- 

• census. One may ask similar questions about the census. 

— Does there exists a &, such that for the tree Tf,, for all n there exists a 
census Cn such that the distribution of ap given c„ has total variation 
distance at least S from uniform! For some models (like ferromagnetic 
Ising and Potts models) this condition is equivalent to uniqueness of 
the Gibbs measure. For others (like colorings) it seems that these 
conditions are not equivalent (we do not know how to demonstrate 
it for colorings; Example 16.21 is an example of such model). 

— Does there exists a b, such that for the tree Tf, and all n there exists 
a census c„ such that P[ap ^ i|c„] = 1 for some i? It follows from 
■31)' Lemma 6.2] that such a b does not exist when M is ergodic. 

• phylogeny. Most of the biological research which is related to recon- 
struction is devoted to problems in which the underlying tree is unknown 
and the algorithm is supposed to find both the tree and the variables at 
the nodes of the tree, see e.g. I13L I15| . These problems seems to be 
quite hard to analyze; in particular, in some cases there is no well-defined 
probability space of trees. In a recent work |33l 134) we show that the re- 
construction of phylogenetic trees is closely related to the reconstruction 
problem. 

• Noisy computation. Von Neumann |43j proposed a model of computa- 
tion in noisy circuits where each gate computes correctly with probability 
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1 — e, The analysis of this model in |43LI36L|lOLIll| has many similarities 
to the analysis of the reconstruction problem for the symmetric binary 
channel, in [SJ 1181 1191 1121 I31j . However, we do not know of any formal 
relationship between the two models. 

7.2. Dictionary. As the reconstruction problem has been studied from dif- 
ferent perspectives, different terminology is often used for the same entities. We 
list below some terms and their translations. 

variable, also symbol, state (finite markov chains), label, letter, message (informa- 
tion theory), spin (statistical physics), color (combinatorics), genotype (biology), 
phenotype (biology). 

b-ary trees, also &-I- 1 regular trees (corresponding to the degree as a graph), and 
Bethe lattice (statistical physics). 

channel, the channel (random function) M corresponds to a stochastic matrix M 
such that Mi J- — P[M(i) — j]. In the statistical physics literature, when working 
with Ising and Potts models, it is common to work with the Hamiltonian: 

(11) ff((a„),eT) = ^/^a„+/3 J2 h-.='r^}- 

^ {v,w) edge of T 

The probability of a configuration {(7v}vgt is then 

(12) iexp(i?((a„)„eT)), 

where Z is a normalizing constant, known as the partition function (it is a function 
of H and of the tree T). The parameter 1/(3 is often referred to as the temperature. 
From Hll|l the matrix M is given by: 

^ ^ exp(/tj+/3(5{»^j-}) 
"'■' X;fexp(/if -K/3(5{j=£}) 

Some families of interest are: The Ising model with no external field, when |^| — 2 
and hi = h2 = 0; The Ising model with external field, where |^| = 2; and Potts 
models without external field where \A\ = q and all the h values are 0. 

The process on the infinite tree then corresponds to the Gibbs measure on that 
tree with the specification that the root distribution is uniform. 

In biology, the matrix M, is related to the mutation rate of the process, and it 
is usually assumed that M is a perturbation of the identity matrix. 

In combinatorics the popular model is proper colorings of the tree, where M is 
a. q X q matrix satisfying M^j- = Su^j\{q — 1)^^. This model is also referred to as 
the zero temperature anti-ferromagnetic Potts model, since M is obtained as the 
limit of the corresponding M for Potts models when f3 -^ —oo. 

reconstruction solvability. Corresponds in statistical physics to extremality of 
the above measure. In information theory, it is natural to express reconstruction 
solvability in term of decay of mutual information (see ProDOsition l2.1|l . 
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8. Very recent results 

Since the submitting this survey, a number of new results on reconstruction 
appeared. In |28| better bounds for the reconstruction problem for Potts and asym- 
metric binary channels were obtained. Essentially the same bounds for asymmetric 
binary channels were obtained independently in 27 . 28 provides a comprehen- 
sive analysis of the mixing rates of Glauber dynamics for Ising and Potts models 
under various boundary conditions and gives a positive answer to Problem 3. 

In 'SOi we analyze robust solvability. This may be thought of as reconstruction 
where the labels at the bottom level are further perturbed. We show that the 
threshold for robust reconstruction is given by 6|A2(A/)|^ = 1 as conjectured in 

m 

Finally, we note that the crucial role of the reconstruction problem in Phylogeny 
was recently demonstrated in |34j and |35) extending the results of |33| . 
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